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Abstract — In this paper, a two-hop communication system with 
energy harvesting nodes is considered. Unlike battery powered 
wireless nodes, both the source and the relay are able to harvest 
energy from environment during communication, therefore, both 
data and energy causality over the two hops need to be considered. 
Assuming both nodes know the harvested energies in advance, 
properties of optimal transmission policies to maximize the 
delivered data by a given deadline are identified. Using these 
properties, optimal power allocation and transmission schedule 
for the case in which both nodes harvest two energy packets is 
developed. 

I. Introduction 

Recent advances in energy harvesting devices allow wireless 
sensor networks to operate in a self-powered fashion for ex- 
tended periods of time. However, due to size of sensors and low 
ambient energy sources, harvested energy from environment 
is low |[T]. Hence, efficient utilization of available energy 
is essential to increase lifetime of sensor networks as well 
as to obtain maximum amount of information from sensors. 
Optimal transmission polices for energy harvesting nodes have 
recently attracted significant interest ||2l- |fT3l . Sharma et al. 
121 and Castiglione et al. [iSj investigate optimal policies for 
stochastic energy arrivals. While ||2l gives optimal policies 
under data queue constraints, f3l generalizes the framework 
to study power allocation for both source acquisition and 
transmission under distortion constraints. Other hne of work, 
such as Bl- lfTSl studies off-line transmission policies, where 
the harvested energies are known in a non-causal fashion at the 
transmitters. The scenarios investigated include single link ||4|, 
single link with varying channel SNR (5], single link fading 
channel ||6]|, multiple access channel 17j|, broadcast channel 
lISl . orthogonal half-duplex relay channel ||9l, and two-hop ifTOl 
networks. Similarly, single energy harvesting link under battery 
imperfections is investigated in ifTTl and lfT2l . The non-causal 
knowledge of the energy harvesting process may be applicable 
for predictable energy models (such as fT3l), or more generally 
allows one to obtain upper bounds of performance. 

In this paper, we focus on a two-hop energy harvesting 
wireless network with a half-duplex relay. Since one of the 
main sources of energy consumption of a node is the power 
amplifier lfT4l . we assume that harvested energies are only 
used for transmission purposes. We also assume that both 
the source and the relay know the harvested energies in 
advance, and that energies arrive with arbitrary amounts at 
arbitrary times. Moreover, both the source and the relay have 
infinite size battery and data buffer Our goal is to find an 
optimal transmission policy (which includes scheduling and 
power allocation), subject to energy causality constraints at the 
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Figure 1. 
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source and the relay and data causality constraint at the relay, 
that maximizes total transmitted data from the source to the 
destination until a given deadline T. Note that ITOl also studies 
two-hop communication with a half duplex relay; however only 
single energy arrival at the source is considered, significantly 
simplifying the problem. Our framework is more general, in 
that, we identify properties of the optimal transmission policy 
for the multi-energy arrival case both at the source and the 
relay. Then, using these properties, we provide the optimal 
policy for two energy arrival case. Optimal transmission policy 
for the multi-arrival case builds upon the two-arrival solution 
will be discussed in future work. 

In the next section, we describe the general system model. 
In Section |III1 we describe some properties of the optimal 
transmission policy. In Section IV, we provide the solution for 
two energy arrivals to the source and the relay. In Section V, 
simulation results are presented. We conclude in Section VI. 

II. System Model 

We consider a two-hop communication system with energy 
harvesting source and relay as shown in Figure [T| The relay 
is half-duplex. Communication takes place until the deadline 
T. The source and the relay harvest energies with amounts 
Es^i > and Er.j > at times ts,o < ••■ < is. a/ and 
tr,o < ■■■ < tr,N, respectively. We set ts^ = ir,o = 
and ts.M = tr,N = T. Inter-arrival times of energy packets 
are denoted as r^.i = ts^i — and Trj = t^j — trj-i 

for all ts^i,trj e [0,7"] for the source and the relay, re- 
spectively. We assume that the energy arrival profiles are 
known to all nodes in the network prior to transmission. In 
addition, it is assumed that each node has an infinite size, 
ideal battery (instantly chargeable and no leakage) and they 
consume harvested energies only for transmission purposes. 
Furthermore, relay has unlimited data buffer We consider a 
non-negative, strictly concave and monotonically increasing 
power-rate function for each terminal, R=g(p), where p is 
the instantaneous power 111 51 . These properties are satisfied in 
many widely used power-rate models, for example. Shannon 
capacity g{p) — log(l + \h\'^p) for complex additive white 
Gaussian noise channel with channel gain h and unit noise 
variance. We assume that the source and the relay are allowed 
to change their data rate instantaneously by changing their 
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power, /jffj. The source-relay and the relay-destination channel 
gains hg and hr are in general different. However, since the 
amplitude of the channel gain can be normalized by scaling 
the harvested energy, without loss of generality, we assume 
that each link has unity gain and therefore characterized by 
the same power-rate function. 

The transmission schedule indicates which node is trans- 
mitting at a given time. Recall that half-duplex constraint sug- 
gests the source and the relay cannot transmit simultaneously. 
Epoch i of the source denotes the transmission period of the 
source when it is continuously scheduled for the z'th time. 
Duration of the epoch i is denoted by ^s,i- Similarly, S^r.i 
denotes the duration of ith epoch of the relay. The power 
allocation functions of the source and the relay depend on 
harvested energies and are represented by Ps{t) and Pr{t), 
respectively. Due to half-duplex constraint Ps{t)pr{t) = 
and the transmission schedule can be inferred from the power 
allocation functions Ps{t) and Pr{t). The total transmitted data 
by the source and the relay up to time t can be expressed 
as Bsit) = J^g{ps{T))dT and Brit) g{pr{T))dT, 
while the consumed energies are Es{t) = jQPs{T)dT and 
Er{t) ~ Pr{T)dT. A transmission policy refers to the pair 
of power allocation functions {ps{t),pr{t)). 

A transmission policy should satisfy energy and data causal- 
ity constraints in order to be feasible. The feasible set of 
transmission policies can be represented as 

S = {Ps{t),Pr{t)\Ps{t)>0,Pr{t)>0,Ps{t)Pr{t)=0, 

Es{t)< E,^,,Er{t)< 

i:0<ts,i<t i:0<tr,i<t 

BAt)<B,{t),te[0,T]} 

Our goal is to maximize the throughput as in OT], which is 
equivalent to maximizing the total transmitted data from the 
source to the destination until the deadline T. The correspond- 
ing optimization problem can be formulated as 

max Br{T) subject to {j>s{t)iPr{i)) e S" (1) 

In the next section, we will provide some basic properties 
of the optimal transmission policy which will then be used in 
Section |IV] to derive an optimal policy for M ^ N ^ 2. 

III. Properties of Optimal Transmission Policy 

Lemma 1: In a single epoch, constant power transmission 
is optimal between energy harvests. 

Proof: This follows from concavity of g(p) and by 
Jensen's inequaUty llT6l : an explicit proof is given in ||4l 
Lemma 2] for single link system. ■ 

Remark 3.1: Lemma 1 suggests that without loss of gen- 
erality we can consider power allocation functions Ps{t) and 
Pr{t) that consist of a sequence of nonzero constant power 
levels ps.i and pr.j, i = 1, m, j = 1, n, respectively. We 
assume that the power level ps.i {pr.j) has duration Zs,i {Ir.j)- 

Lemma 2: It is optimal to deplete all batteries by the 
deadline T. 

Proof: This is proved in ifTTl Lemma 5] for a single energy 
harvesting link with finite capacity battery. The proof can be 
simply extended to our model. ■ 
The following remarks will be useful in proving the subse- 
quent lemmas. 



Remark 3.2: Given any feasible transmission policy, delay- 
ing an epoch of the relay does not violate feasibility (provided 
half-duplex constraint is still satisfied) because postponing the 
relay transmission allows it to store more energy. Similarly, 
moving an epoch of the source to an earlier time (subject to 
half-duplex and source energy causality constraints) does not 
violate data causality. Moreover, the above argument still holds 
if an epoch of the relay is delayed to switch with a later epoch 
of the source. 

Remark 3.3: Consider an optimal transmission policy. We 
can obtain another feasible policy by moving all source epochs 
to earlier time instants (possibly by delaying relay epochs) 
provided energy causality of the source is maintained. Using 
Remark 13.21 we can argue that the new transmission policy 
maintains optimality. Hence, we will restrict attention to poli- 
cies for which the source transmits whenever it has energy. 

Lemma 3: Given any feasible transmission policy for which 
Ps{t) = Pr{t) = for some t e [0,r], we can find another 
feasible policy such that it transmits at least as much data by 
fully utilizing channel, that is never turning off both the source 
and the relay. 

Proof: This is argued in lITSl for a single link using 
properties of g{p). To extend it to the two-hop case we define 
an idle time as a period for which ps (t) = p^ (t) = 0. For any 
feasible transmission policy, if we have an idle time right after 
an epoch of the source, we can invoke arguments similar to 
ifTSl to extend the epoch of the source and ensure that there 
is no idle time. Note that this strategy continues to satisfy 
energy and data causality constraints. On the other hand, if 
an idle time occurs after an epoch of the relay preceded by a 
source epoch, the relay epoch can de delayed without violating 
feasibility (see Remark [l!2] ). and the previous argument can be 
used to extend the source epoch to result in no idle time. For 
the case of multiple consecutive relay epochs this argument 
can be applied repeatedly to each epoch. ■ 
Corollary 3.1: In an optimal transmission policy, the fol- 
lowing properties are satisfied: 

1) Epoches that belong to same node are not adjacent, that 
is, each epoch of the source is followed by an epoch of 
the relay. 

2) Source and relay batteries cannot be empty simultane- 
ously for any is [0,T]. 

3) A new epoch of the source starts immediately after the 
data buffer of the relay becomes empty. 

Lemma 4: In an optimal transmission policy, source and 
relay power levels form a non-decreasing sequence in time 

that is Ps^i < Ps.2 < ■■■ < Ps.m and Pr.l < Pr,2 < ■■■ < Pr,n- 

Proof: First, we consider source power levels within an 
epoch. For any transmission policy, keeping the consumed 
energy and duration of each source epoch the same, the source- 
relay communication in that epoch can be thought of as a 
single energy harvesting link. Using ||4] Lemma 1] where 
transmission time minimization problem is considered, we can 
argue that by having monotonically increasing power levels in 
that epoch, the duration of the epoch can be decreased while 
the amount of data transmitted and total energy consumed in 
that epoch remains the same. This, however, leads to an idle 
time, which can be removed by Lemma [3] The same argument 
is valid for the relay as well. 



3 



Next, we consider source power levels in successive epochs. 
Consider power levels ps.i and Ps.i+i in two successive epochs 
of the source (recall that zero power levels are not consid- 
ered in Ps,i)- Assume ps.i > Ps.i+i- Then, by keeping Ig.i 
and ls,i+i the same and equalizing power levels, the source 
transmits strictly more total data due to strict concavity of 
gip). However, the transmitted data in the first epoch decreases 
due to decrease in ps,i. Therefore, the new policy may violate 
data causality in the following relay epoch. By delaying the 
relay epoch (Remark I3.2l i. we can find another transmission 
policy which does not violate data causality. Similarly, for 
two successive relay epochs assuming pr.i > Pr,i+i, the relay 
transmits strictly more data by equalizing power levels. Unlike 
source, decrement of transmitted data in the first epoch is 
always feasible. However, increase in the total transmitted data 
by the relay may violate data causality in the following epochs. 
In that case, the new policy can be further replaced by another 
one which is obtained by increasing duration of preceding 
source epoch while keeping consumed energy the same and 
decreasing duration of the first relay epoch while keeping 
power levels equal. Clearly, the new policy transmits strictly 
more data. Finally note that after equalizing power levels of 
successive epochs, if necessary, power levels in each epoch can 
be further arranged to ensure that power does not decrease. ■ 

Lemma 5: Any feasible transmission policy can be replaced 
with one that satisfies the following property without de- 
creasing the data rate: Within an epoch of source (relay), 
whenever the source (relay) power changes, total consumed 
source (relay) energy up to that point becomes equal to total 
harvested energy. 

Proof: Suppose for a feasible transmission policy, the 
statement of the lemma does not hold for a particular source 
epoch. Then by ||4|, the policy can be replaced by another 
one which transmits the same amount of data consuming the 
same energy but in a shorter epoch duration. Note that by ID 
Lemma 3] this new policy satisfies the property specified in the 
Lemma. Then by Lemma |3] this policy can be further replaced 
by another one which transmits at least the same amount of 
data and leaves no idle time. The same argument is valid for 
the relay as well. ■ 

Lemma 6: Any feasible transmission policy can be replaced 
with one that satisfies the following property without decreas- 
ing the data rate; There is a relay epoch between source power 
levels ps,i < Ps,i+i provided there is nonzero energy in relay 
battery just before the time that source power level changes. 

Proof: Without loss of generality, consider an epoch of the 
source in which the statement of the Lemma does not hold. 
Also assume this epoch has two power levels ps^i < Ps.i+i 
such that total consumed source energy up to the time that 
the power level changes equal to total harvested source energy 
(Lemma |5]). By keeping consumed energy for the duration of 
each power level (/s,i and ls,i+i) the same, the policy can be 
replaced by another one which has power levels p'^ j, p'^ ^j^-^ 
of durations l'^ ^ = Is^i - e, = ls,i+i + e with p^^i < 

P's i < Ps i+1 < Ps,i+i and relay transmitting for e duration 
in between these power levels provided it has energy. Relay 
transmission is obtained by moving e portion of the following 
relay epoch earlier. Since relay has nonzero data in its data 
buffer, the new policy remains feasible. In the new policy. 



difference between the first and second power levels p'^ ^ and 
p'^ decreases without changing total transmission duration 
of the nodes, hence, the source transmits more data due to 
strict concavity of g{p). ■ 

Lemma 7: In an optimal transmission policy, whenever the 
relay power levels change in two successive epochs, either the 
data or the energy buffer of the relay is empty at the end of 
the first epoch. 

Proof: The proof is given by contradiction. Suppose the 
lemma is not satisfied. Consider power levels Pr^i < Pr.i+i 
in two successive epochs of relay (see Lemma ID). Then, by 
keeping Ir.i and Ir.i+i the same we can increase pr,i and 
decrease Pr.i+i unless battery or data buffer of the relay is 
empty at the end of the first epoch. As a result, the relay 
transmits strictly more data due to strict concavity of the g{p). 
This operation can be done until battery or data buffer depletes, 
or power levels equalize. However, due to increase in the total 
delivered data by the relay, the new transmission policy may 
violate data causality in the subsequent epochs. In that case, 
as in the proof of Lemma |4] this policy can be further replaced 
by another one which is obtained by decreasing duration of 
the first relay epoch, and increasing duration of the preceding 
source epoch until data causality is satisfied. Clearly, in the 
new transmission policy, the total transmitted data by the relay 
is increased compared to the initial policy. This results in a 
contradiction, hence the lemma must be true. ■ 

Lemma 8: In an optimal transmission policy, source and 
relay transmit same amount of data until the deadline. 

Proof: Consider a feasible transmission policy such that 
the source transmits more data than the relay until the deadline, 
i.e., Br{T) < Bs{T), hence total deHvered data to the 
destination is Br{T). While keeping the consumed energy 
in each epoch the same, increasing duration of an epoch 
strictly increases transmitted data in that epoch ifTSll . Therefore, 
the initial policy can be replaced by another one of higher 
rate obtained by increasing duration of relay epochs while 
decreasing duration of the following source epochs under data 
causality constraint. However, if the first epoch of the source 
transmits more data than total transmitted data in the following 
relay epochs, we need to decrease the duration of the first 
source epoch while increasing duration of the first relay epoch. 
Note that increasing duration of the first relay epoch is always 
feasible due to Er.o > 0. Combining these, we can find another 
feasible policy transmitting higher data such that source and 
relay transmit same amount of data until the deadline, i.e., 
Br{T)^Bs{T). ■ 
IV. Optimal Policy for Two Energy Arrival at the 
Source and the relay 

In this section, using the properties developed in Section 
ITm we describe the optimal transmission policy for the two 
energy arrival case, AI = N ^ 2. We use the cumulative 
curve model as in [17|. We will refer to cumulative energy 
arrival curve as the harvested energy curve lfT2l . Similarly, 
the transmitted energy curve refers to Es{t) or Er{t). By 
Remark 13.11 we express ps [t) by sequence of constant source 
power levels including zero power using the power vector 
Ps = [Ps,i, 0,ps^2, •■•] with the corresponding duration vector 
Is = [/s,i, Cr.i, •■•]■ Similarly, and 1^ can be defined. 
Note that (p^, 1^, Pr, Ir) constitutes a transmission policy. 
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Figure 2. (a) Source energy-time region and sub-regions for tlie corner point 
(ts^i, -Es,o)- (t)) Relay energy-time region and sub-regions for the corner point 
{tr,i, Erfi). Cumulative harvested energy is represented by solid lines and 
boundaries of the regions are represented by dashed lines. 



The solution of the optimization problem ([T]i depends on 
the location of the corner points {ts,i,Esfl) and (tr,i, -E^.o) 
of the harvested energy curves of the source and the relay, 
respectively, in the energy-time region. Here, tg.i and i^.i are 
the source and relay energy arrival instants, Egfl and Er^ are 
initial battery energies and the energy-time region of the source 
and the relay refer to the rectangles defined by the points 
[T.Esfi + Es^i) and (T, i?r,o + Er.i), respectively. We first 
identify two sub-regions Si and S2 of the energy-time region 
of the source as shown in Figure |2(a)| Boundaries of these 
regions are specified by dashed lines, and will be described 
subsequently. Then, based on the location of (fs,i,i?s,o) with 
respect to these sub-regions we identify regions on the energy- 
time region of the relay. If (tg^i, i?s,o) G Si, there is a single 
relay region which is the complete relay energy-time region. If 
{ts^i,Es,Q) S S2 we specify four sub-regions TZi,TZ2,TZ3,TZ4 
for the relay corner point as illustrated in Figure |2(b)| These 
sub-regions depend on the harvested energy curve of the 
source as well as (T, Er.o + Er^i). Boundaries of each region 
along with the respective optimal transmission policy will be 
described when the corresponding region is discussed. 

In the solutions described, it can be argued that obtained 
optimal power vectors and p,. are unique. The proof of 
uniqueness will be omitted due to space constraints. 

A. Source Region Si 

Source region Si is defined such that the source will be 
able to transmit with constant power in a single epoch without 
violating energy constraints (Lemma [H Remark [331 ), and the 
relay will transmit in the remaining time (Lemma [3]i. Thus, 
the problem reduces to single energy arrival for the source 
and two energy arrival for the relay, i.e., Af = 1, = 2. 
This problem is solved in ifTOl which gives a unique optimal 
source epoch length ■Cs i = ^s,i- Clearly, the source transmitted 
energy curve ^iven by the line starting from origin with slope 
Ps,i = "'"^t provides the boundary of Si because the 
source can deplete all harvested energy in a single epoch if and 
only if the corner point (ts,i, Eg.o) is above this line (Lemma 
|5]l. Therefore the condition < EiA defines region 

Si , and the algorithm in IflOl provides an optimal transmission 
policy. 

B. Source Region S2 

Whenever < the corner point {ts.i,Es.o) 

will be in region ^2. In this region, the source has two 
epochs (Lemma |6]l, hence, the relay has two epochs (Lemma 
[3] Remark I3.3l l. Also, the source has constant power levels 



Ps,i and ps 2 with lengths S^s.i and £^s.2, respectively, in these 
epochs. The optimal transmission policy further depends on 
the region relay's corner point (tr,i,^'r,o) lies in and will be 
specified below. 

1) Relay Region TZi: In this sub-region, relay's harvested 
energy does not restrict its transmitted energy curve. Therefore 
we argue that total transmitted data is equal to the case that the 
relay harvests all its total energy £",,,0 + Er,i at i = 0. Using 
Lemma [T] and Lemma |5] both the source and the relay have 
a single power level in each epoch. We first find an optimal 
transmission policy by assuming that the source harvests all its 
energy Eg^ + Eg.i at i = 0. Since this corresponds to largest 
transmitted data, if we can find a feasible policy transmitting 
this amount of data, it needs to be optimal. Solving 

9 (^^^±^) t = g ( ^-^+f -^ ) (T - t) (2) 

for t gives the optimal division of total time T between the 
source and the relay when all energies are harvested at t = 0. 
Equality in © is due to Lemma[8](Bs(r) = Br{T) = B{T)). 
To obtain a transmission policy we set 1 + = t 
where t is the solution to (2) and use power levels as in the 
solution of The transmitted data in the first epoch of the 
source and the relay are then ^B{T) and B{T), 
respectively (Remark O. If ^B{T) > ^^^^(T), 
the transmission policy is feasible. Then, the optimal trans- 
mission policy is Ps = j- g^.o-f-Bs,! ^ -E3,o+-Ea,i ^ Qj^ = 

r -^3,0 I , -Es.o J. Es^i , -Es , 1 ^1 

^B^.o+Bs,! ' £;3.o+-Es,i ' ■E3_o+-Es,i ' '''^ Bs.o+-Es,i ^' 

„ _ rn -Er,o+-Er,i n -E.-.o+-Er,i 1 1 _ r ^s-o -> -f _ 

g.r+k.i ^' g.,f+k.i ^'^"'' ~ g.,f+k,i ^]' ^h^""^ ^ is *^ so- 
lution of (|2]l. 

If above transmission policy violates data causality con- 
straint, i.e., %iB(r) < ■^T^B(T), invoking Lemma |8] 
the optimization problem in ^ can instead be written as 

max Vg i ^T^ ] is,i (3) 

s.t. g 6.. = 9 [^f^) {rs.r - 6,0 * = 1, 2(4) 

where < Ts^i = 6,2 < r^a, < E'^^ < 

Erfl + Ers and E'^_^ + E'^^^ = iJ^.o + -E'r,!- The equahty 
in dUi is due to empty data buffer at the end of the first 
relay epoch. Then, the optimal transmission policy is p^ = 

["elf "I^T'*^]' ^'^ ^ [6,1, is, 1 - 6,1, 6,2, Ts, 2 - 6,2], Pr = 
[0, 7— ;-,0, 3 ]' Ir = [6,l,is.l^6,l,6,27'''s,2~6,2]> 

where 6,i> 6,2 are the solutions to (O subject to 

Clearly, the relay transmitted energy curve given by the op- 
timal transmission policy (p^, Ir) above provides the boundary 
between TZi and the other relay regions because if (tr,i, Erfl) 
is above this curve, the policy (pr,lr) is feasible. 

2) Relay Region 712-' In this sub-region, energy causality 
is violated in the first epoch of the relay if the transmission 
policy in TZi is used. Therefore, using Remark |331 Lemma [3] 
and Lemma |5] the relay power level changes at tr,i resulting 
in two power levels pr.i and pr,2 in the first epoch of the relay, 
and a third power level pr,3 in the second relay epoch. Note 
that pr.2 and Pr,3 are different only if the relay data buffer is 
empty at the end of the first epoch (Lemma |2ll. In the optimal 
policy whether the data buffer is empty or not is not known 



5 



in advance; therefore we provide two different solutions based 
on the state of the relay data buffer When pr,2 = Pr.3, we 
maximize Q subject to 

Ell 9 (%r ) 6. = Ell 9 (^fptr) (^r, i.,) (5) 

where ^^,1 < ''r,! = tr.i^ £,s.2 < Ts.2- Note that, 1^.2 + 
lr,3 are considered together as Tr,2 — Cs,2 because Pr.2 = 
Pr-3, and = tr,i — ^s.i- The optimal policy is = 



ir,i>^s,2,Ts,2-6,2] whcrc ^s,i, ^s,2 are the solutions to (O-©. 

If the above transmission policy violates data causality, i.e., 
9 (Ps.i) 6,1 < 9 (Pr,i) {tr,i - + 5 (Pr,2) {ts.i ~ U.i) in the 
above policy, we maximize (O subject to 



9 -T^ 6,1 



is 



(6) 



Ts,2 - £.s,2 



{Ts,2 - Cs,2) 



(7) 



where ^s_i < t^.i, ^s.2 < ^^.2 and Er.i < Er^i with ij^,! 
corresponding to the consumed energy by the relay in the 
interval The first constraint in ^ is due to empty 

data buffer at the end of the first relay epoch, and the second 
constraint in (|7| is due to Lemma [8] The resulting optimal 
transmission policy is Ps = [-|^,0, -1^,0], Is = [Cs.i,^s,i — 

6,l'6,2,Ts,2-6,2], Pr = [0, tTJ^tj' t=,f-t,l'*^' rl:2-f.'^2']' 
Ir = - S.s,l,ts,l - tra,6,2,Ts,2 - 6.2] whcrC ^,,,1, 

^s,2 and Er.i are the solutions to (|3]l subject to ©-(ITli. The 
boundary between 7^2 and TZs is specified as the vertical line 
at ts.i (Remark 13.3b . 

3) Relay Region TZs: We define this sub-region as the 
region in which the relay optimal transmission policy only 
uses the first harvested energy Er^ in its first epoch and the 
second harvested energy iJ^.i in the second epoch (Remark |3.3l 
Lemma |7]i. Therefore, the relay has power levels p^.i = 

and pr,2 ~ with durations ^^.i and ^^,2 in the first and the 
second epoch of the relay, respectively. As discussed in Lemma 
111 these power levels are different since all energy Er.o is used 
in the first epoch. In the following lemma we prove that relay 
still has data in its buffer at the end of the first epoch. 

Lemma 9: In an optimal transmission policy, relay has 
nonzero data at the end of the first epoch if (tr.i, Er,o) E TZ^. 

Proof: First consider a policy of the form described above 
in which the relay consumes E^.q and has empty data buffer at 
the end of its first epoch. Let's denote corresponding optimal 
epoch durations of the source as (^j i, 2)- Now, consider the 
optimal policy in TZi . Clearly, in that policy consumed energy 
in the first epoch of the relay is larger than i?,.,o- Let's denote 
corresponding epoch durations of the source as (Cs 1 7 ?s 2 ) ■ 
Clearly, > and 2 > '?s,2 due to concavity of 
g{p)- Then, in order to make the second policy consume Er^ 
in the first epoch of the relay, we can increase first epoch 
duration ^ and decrease second epoch duration 2 of the 
source. We denote modified epoch durations of the second 
policy as (Csi7?s2)- Then, we have two feasible policies 



both consuming £"^,0 in the first relay epoch; one has source 
epoch durations (^^ 1 , 2 ) ^^ch that data buffer of the relay 
is empty at the end of the first relay epoch, the other one 
has source epoch durations (^si,^s2)- Consider the policy 
with epoch durations (Cs iiCs 1) '^^at are convex combinations 
of (^1.1:^1.2) and (^s,i,^s,2)- This policy satisfies energy 
causality and transmits strictly more data due to strict concavity 
of the objective function in ([3]). Since ^'^ ^ > £,s it is also 
feasible, and the relay has nonzero data at the end of its first 
epoch. ■ 
Using Lemma |9] we maximize (|3]l subject to 

Ell 9 (%-^) 6,. = Ell 9 (t^T^) (rs,. - 6.0 (8) 
- ts,i < 6,2 (9) 

where ^ 1 < 1, 62 < Ts2- The constraint in ^ 
is due to definition of TZ^. The optimal policy is p, = 



[0 



,0, 



[?s,l, is,l ~ S,s,l,£,s,2,Ts,2 — 6,2], Pr — 
t-l, 2 ^' ^ Ks,li is.l ^'fs.lj Cs,27 '''s,2 ^'^S.2] 



where ^.i, 6.2 are the solutions to Q subject to (l8]l-(|9]l 

4) Relay Region TZ^: In this sub-region, energy causality is 
violated in the second epoch of the relay if the transmission 
policy in TZi is used. Therefore, the relay power level changes 
at tr,i as argued in Lemma |5] Hence there is a single power 
level pr^i in the first relay epoch and there are two power levels 
Pr.2 and p,. 3 in the second epoch. Note that p^ i and ps,2 are 
different only if the relay data buffer is empty at the end of 
the first epoch (Lemma [TJ. However, whether the data buffer 
is empty or not is not known in advance. Hence we first solve 
the case that relay data buffer is not empty. Solving 

9 (^-4^) t = g (^) -t)+g (1^) r,^IO) 

for t = 6,1 + Cs.2 gives the optimal total transmis- 
sion duration of the source. We further have t^.i — 
t = Ir.i + ^r.2 because p^.i = Pr,2, and 6_2 < 
tr.i — ts.i (due to definition of TZa). The resulting op- 
timal policy is p, = [ ^°-°+^"S 0, ^°-''+^--\ 0], 1 



-E, 

Pr = [0, 
t. 



Er.o 



E, 



tr l~t 



Es.Q+Es.i ' Es.o+Es.i 

Er^ E; 

rT.2 



t 

t, Ts,2 



Es 



-^'-■P ±2^11 1 



Es 



Es.Q+Es.i 



,0 



tr.l-t 



s,l 



Es 



Es,o+E. 



— t, Tr 2] where t 



Es.a+Es.i ' Es.o+E 

is the solution of ([lOl 

If the above transmission policy is infeasible, i.e., 

9{PsA)isj < g{pr,i){ts,i - 6,1)7 then we maximize (0 
subject to 



E. 



s,0 



6,1 

Es,i 
6,2 

= 9 



6 



E, 



9 



r.O 



Cs.l 



s,l 



{tsS — 6,1) 



(11) 



6,2)+.g(f^)T,,:^12) 



where _6,i < tss, 6,2 < tr,i - is,i and Erfi < Er,o 
with Er.G corresponding to the consumed relay energy in 
interval (6,17^5,1)- The resulting optimal policy is Ps = 

[-^707 -17J7O], Is = [6,17*8,1 - 6,l7 6,2,Ts,2 - 6,2], 

E-r.a n Er-.Q — Er-.Q E^ 



Pr 



[O7: 



,0,- 



Ir 



[6,17*8 



4s, 1 ' ' tr,l-t3,l-Cs,2 ' Tr.2 - 

6,i7 6,27*r,i - - 6,27 Tr,2] whcrc 6,i7 6,2 and Er^o are 
the solutions to (O subject to (fTT])-(fT2b. 
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There is no closed form expression for the boundary between 
TZs and 72-4, therefore to decide which relay region a corner 
point {tr,i,Erfi) lies in, we check the following conditions: 

. If gij^j^JiUj - > 9{^J{T - U.il the 
corner point {tr.i,Er.o) is in TZ4. This follows from 
constraint (|9|l, that is, if (tr,i,£'r,o) G Ti-s, the total 
transmitted data in the second epoch of source would be 
more than the total transmitted data in the second epoch 
of relay violating (|9j. 

• If 5(Ff^)(*'M " < 9ij^,)iT ~ trs\ max- 

''T', 1 ,1 J- ''1", 1 

imizing (O subject to (fll])-(fl2l) cannot give a feasible 
policy because (fTTli-(fT2]i require empty data buffer at ts.i 
and tr,i — is,i > £,s,2- In this case we first solve (fTOl i. 
If the resulting poUcy is not feasible, i.e. g{ps.i)£,s,i > 
9{Pr,i)iTs,i — ^s,i), then we conclude that {tr,i,Er,o) G 
TZs. It can be argued that when (tr,i, -^^,0) G 7^3, the 
solution of ( [Tol l never results in a feasible policy. 

V. Numerical and Simulation Results 

In this section, we provide simulation results to illustrate 
average performance improvement provided by the optimal 
transmission policy developed in Section |IV] compared with 
a simple transmission policy. We set T = 1. We assume that 
for both source and relay, harvested energies are independently 
chosen from the exponential distribution with parameter A and 
tg^i and are uniformly distributed in the interval [0,1). 
Once harvested energies and harvest instances are randomly 
chosen, they are revealed to source and the relay, which can 
then carry out the corresponding transmission policies. We 
consider the Shannon power-rate function g{p) — log(l +p). 
The simple transmission policy operates in a time slotted 
fashion with slot durations -j = 0.5. In the first slot, the 
source transmits and in the second slot the relay transmits. 
In each slot, the transmission powers are chosen to maximize 
the amount of data per hop using 141, and the total transmitted 
data is the minimum of transmitted data in each hop. Figure |3] 
shows the average throughput as a function of A. This figure 
illustrates that significant throughput improvement is possible 
by employing the optimal transmission policy. 

We next fix E^.o = E^.i = Er^ = E,.,! = 5, T = 10, Us = 
8 for g{p) ~ log(l+p). Figure |4] shows the optimal throughput 
(total data transmitted/T) and the corresponding regions for the 
source/relay corner points as a function of ts,i. We observe that 
all parameters affect the location of the corner points and the 
resulting throughput. 

VI. Conclusions 

In this paper we studied energy harvesting two-hop networks 
to maximize data delivered to the destination by a given 
deadline under non-causal knowledge of the harvested energy 
profiles. We first identified properties of an optimal transmis- 
sion policy subject to energy and data causality constraints. 
Then, we provided the optimal policy for two energy arrival 
at the source and the relay. Numerical and simulation results 
clearly illustrate the benefits of employing an optimal policy. 
Details of the optimal policy for the multi-energy arrival case 
as well as online policies will be discussed in subsequent work. 
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Figure 3. Performance comparison of the optimal transmission policy with 
a simple one that operates in a time-slotted fashion. 




Figure 4. Optimal throughput and source and relay corner point regions as 
a function of t^.i for E^fi = i?s,i = -Er.o = ^r,l = 5, T = 10, = 8. 
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